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1. Introduction 

Fix an irrational number a. The problem of the distribution of 
the local spacings between the members of the sequence n 2 a mod 1, 
1 < n < N, has received attention recently (see [0, 0, Q). It arises 
for example in the study of the local spacing distributions between the 
eigenvalues of special Hamiltonians. We order the above numbers in 
[0, 1) as 

(1.1) 0</3 1 </3 2 <---</3 7V <l 

and set f^N+j = Pj- The fc-th consecutive spacing measure is defined to 
be the probability measure on [0, oo) given by 



1 - 

:i-2) fi k (N,a):=-J2$ 



N 

r- 



where 5 X is a unit delta mass at x. The problem is to understand 
the behavior of these measures as N —>■ oo and in particular their 
dependence on the diophantine approximations to a. 

We say a is of type K if there is c a > such that \a — || > c a q~ K 
for all relatively prime integers a and q. It is easy to see that if a is not 
of type 3 then there is a subsequence Nj — > oo for which the measures 
Hk{Nj, a) converge to a measure supported on N = {0, 1, 2, ■ ■ ■ }. On 
the other hand numerical experiments || indicate that for a = V2 
these k-th consecutive spacings behave like what one typically gets 
for spacings when placing numbers in [0, 1] uniformly and inde- 
pendently at random That is /j,k(N,y/2) appears to converge to 



a** := i e xdx - 

Date: February 1, 2008. 

(1) Supported in part by a grant from the U.S. -Israel bi-national Science Foun- 
dation. 

(2) Supported in part by a grant from the U.S. -Israel bi-national Science Foun- 
dation and the NSF. 

1 



2 ZEEV RUDNICK (1) , PETER SARNAK (2) AND ALEXANDRU ZAHARESCU 

A standard approach to the analysis of the consecutive spacing is 
via local m-level correlations. These are defined as follows: As test 
functions we use functions f(x\,X2,--- ,x m ) which are symmetric in 
(xi, X21 ■ ■ ■ ,x m ) and which are functions of the difference of the coor- 
dinates, that is f(x + (t,t, - ■• ,t)) = f(x) for all t 6 R. We assume 
further that / is local, that is it is compactly supported modulo the 
diagonal. We will call these admissible test functions. Define the cor- 
relations 

(1.3) RM(N,a,f):=± £ / (N(P h , ■ ■ ■ , 

l<h<-<j m <N 

Note that is not a probability density and it may well tend to 

infinity as N —>■ oo (think of the case when a is rational). In the case 
that the /3's in fly]) come from a random choice of points in [0,1) these 
correlations satisfy 

(1.4) i2< m >(JV,/)-> I f(0,x 2 ,--- ,x m )dx 2 ---dx m . 

J 0<X2<-..<x m 

We say that n 2 a mod 1, n < N, is Poissonian if for all m > 2 and 
/ as above 

(1.5) R( m \N,aJ)^ ! /(0,x 2 ,--. ,x m )dx 2 ---dx m 
as N oo. 

As with the method of moments in convergence of measures, if the m- 
level correlations are Poissonian then the consecutive spacing measures 
fik{N,a) converge to /i^. Thus Poissonian in the sense of (|1.5| ) (i.e. 
for correlations) implies that as far as local spacings go, the numbers 
behave randomly. We will also consider cases where (|1.5| ) holds along 
a subsequence Nj — > oo, in such a case we say that n 2 a mod 1, 1 < 
n < N is Poissonian on a subsequence. 

The results below lead us to the following 

Conjecture: If a is of type 2 + e for every e > and the convergents - 
to a satisfy lim^oo = 1, where q is the square free part of q, then 
n 2 a mod 1 is Poissonian. 

We note that almost all a (for Lebesgue measure) satisfy the hypoth- 
esis in the Conjecture and that assuming some standard conjectures in 
diophantine analysis any real algebraic irrationality satisfies these hy- 
potheses (see Appendix 0). 
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Unfortunately the methods of this paper appear not to be power- 
ful enough to prove anything for the numbers a in the Conjecture. 
They require that a have somewhat better approximations by ratio- 
nals. One of our main results gives conditions on the diophantine ap- 
proximations to a which ensure that n 2 a mod 1 is Poissonian along a 
subsequence. In particular this allows us to conclude that for the topo- 
logically generic a (i.e. in the sense of Baire) n 2 a mod 1 is Poissonian 
along a subsequence. On the other hand the naive expectation that for 
any irrational a, n 2 a mod 1 is Poissonian along a subsequence, fails 
dramatically. The source of this phenomenon is large square factors 
in the denominator of the convergents to a. We will exhibit an a for 
which the 5-level correlations go to infinity as iV — > oo. We also pro- 
vide an a of type less than three and a sequence of integers {Nj}°? =1 
along which the 5-level correlations diverge to infinity. 

The precise statements are as follows: 

Theorem 1. Let a G R. Suppose there are infinitely many rationals 
bj/qj, with qj prime, satisfying 



a — 



< 



log Nj 



1 for which (FOL) 



Then there is a subsequence Nj — > oo with 1( 

holds for allm > 2 and all f . That is to say n 2 a mod 1 is Poissonian 
along this subsequence. 

With a lot more work concerning the exponential sums discussed in 
Section |9], for general moduli q, we can relax the condition that qj be 
prime in Theorem [l]. In fact we can prove the following (we do not go 
into the proof in this paper) which shows that for such approximants 
the size of the square free parts <jj of qj is decisive. 



Theorem 1': Let a be an irrational for which there are infinitely many 
rationals bj/qj satisfying 



a 



1 

< — • 

q] 



Then the following are equivalent : 

(i) There is a subsequence Nj — > oo with 

mod 1 is Poissonian along Nj. 
(ii) 

log qj 



1 such that n 2 a 



lim 

j—oo log qj 



1 . 
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As to the divergence of correlations we have: 

Theorem 2. (a) There is an irrational a and a test function f such 
that 

R {5) (N,aJ) > m (log iV)~ 5 , as N -> oo . 

(b) For every o > 23/8 there is an a of type a and a test function f 
such that 

Tirn~R (5) (iV,a,/) = oo . 

The test functions / in Theorem |2| are nonnegative and are supported 
in a neighborhood of (modulo the diagonal) and the source of the 
divergence is that there are zero density, but non-negligible, clusters 
among the numbers n 2 a mod 1 , n < N. 

We note that these clusters which spoil the correlations do not have 
the same effect on the probability measures /ifc(iV, a). So it is quite pos- 
sible for example that the a in part (b) of Theorem ^ has its //fc(iV, a) 
measures converge to the Poissonian We have chosen in this paper 
to call n 2 a mod 1 Poissonian if the strongest behavior holds - that is 
the correlations are Poissonian. 

The proofs of the above theorems are based on the following closely 
related diophantine problem: Consider the spacing distributions (nor- 
malized to mean spacing 1 as before) of the numbers {bn 2 /q}, n < N, 
or what is the same, the spacing distribution of the integers 

(1.6) n 2 b mod q, 1 < n < N. 

Here q is prime (q — > oo), b is any number not divisible by q and iV is 
in the range [q 1 ^ 2+e , for some e > 0. The reason for this range for 
iV is that if iV < ^fq and say b = 1 then the spacing distributions may 
be easily determined (since n 2 < q for n < N < Jq) and are certainly 
nonrandom. Similarly if iV = q then the sequence in ( |1.6| ) consists of 
all the quadratic residues (or non-residues) and hence the spacings are 
integers and so cannot follow a Poissonian law. In fact the limiting 
spacing distributions of Hk(q,q,b) were determined by Davenport || 
0. So it is only in the range iV G [g 1 / 2+e , ^—] that we can hope for 
randomization. The following Theorem shows that indeed, to a certain 
extent, this is the case. 

Let R( m \N,b/q, f) denote the scaled m-level correlations for the 
sequence (|1.6|). 

Theorem 3. Fix m > 2 , f and 5 > 0. Then as q —>■ oo, q prime 
R( m \N,b/q,f)^ I /(0,x 2 ,--- } x m )dx 2 ...dx m 

Jo<X2<...<x m 
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uniformly for (b, q) — 1 and q 1 2m + 5 < N < — 



A crucial ingredient in our proof of Theorem |3] is the Riemann Hy- 
pothesis for curves (of arbitrary large genus) over finite fields (Weil 

0). 



In the range in which Theorem |] applies it gives Poisson statistics 
and Theorem |] easily yields Theorem [l|. For m > 3 it is not possible to 
extend the range of N in Theorem |3| much further. The reason is related 
to the previous divergence of correlations phenomenon. For suitable b 
(depending on q) there will be large clusters among the numbers n 2 b 
(mod q) , n < N. This is highlighted by the following Theorem. 

Theorem 4. Fix m > 3 and 5 > 0. Then there is a test function f 
such that for qz < N < q m + 2 ~ s , 

lim max R (m) (N, b/q, /) = oo . 

q^oo (b,q) = l 

Acknowledgment: We would like to thank E. Bombieri for his 
help with the application of the ABC conjecture described in Appen- 
dix [A]. We also thank the referee for suggesting a stronger version of 
Theorem |2] (a) with a simpler proof than our original one. 

2. A COMPARISON LEMMA 

We will need to deal with the following situation: We are given two 
families of sequences Af = {xn(u) : n < N} and Af' = {x' N {n) : N < 
N} in [0, 1) and we wish to compare the limiting correlation functions 
of these two families, seeking to show that if the correlations exist for 
one sequence then they exist for the other, or they diverge for one if 
they do for the other. We show that it can be done if the two sequences 
are close in a suitable sense. We define the scaled distance between the 
sequences to be 

e(Af,M') := N ma.x\xN(n) — x' N (n)\ . 

n<N 

A general method for carrying out the comparison is formalized in the 
following: 

Lemma 5 (Comparison Lemma). Assume thatAf,Af' C [0, 1) are two 

families of sequences with e(Af,Af') — >• as N — » oo. Then for all 
smooth test functions f , we have 

| -R (fc) (Af, f) - -R (fc) (Af', f) | < R ik) (Af,f + )e(Ar,A/') 

for N sufficiently large, where f+>0isa smooth admissible test func- 
tion (depending only on f). 
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Proof. For notational simplicity, we will do the case of pair correlation 
(k = 2). Our test function / 7^ can then be written as f(x\,x 2 ) = 
g(xi — x 2 ) for some g G C£°(R), say g supported inside [— p, p\. Let 
g+ > be smooth, compactly supported and such that g + is constant 
on [— 2p, 2p], where it equals max|g'|. Set f+(xi,x 2 ) : = 2g + (xi — x 2 ). 
For further notational simplicity also set 5 m>n := Xs{rri) — Xn(ti) and 
: = x' N {m)-x' N {n). 
By the mean value theorem we have 

J2« (A/", /) - (A/"', /) = i £ ff (iV5 m , n ) - ^(iVC,J 

l<m<n<iV 
l<m<n<N 

where £ TOi „ lies between <5 min and S' m . 

For the difference i?( 2 ) (/, Af) - R^ (f, Af') to contain a nonzero con- 
tribution from the term indexed by the pair (m, n), we must have at 
least one of N5 m>n or N5' mn lying in suppg C [— p, p). Now iV£ mj „ is 
within 2e{H,Af') of both N5 mtn and N6' mn , which implies that both lie 
in [—2p, 2p], as does £ mjn if N is sufficiently large so that 2e(Af,Af') < p. 
Since g + is constant on [— 2p, 2p] we find that g + (£ mtn ) = g + (N5 mtn ). 

Thus we get 

\R^(Af, f) - R^(Af', f)\ < 1 £ <? + (A^ m , n ) ■ 2e(A r , A/"') 

l<m<n<iV 

= J R( 2 )(AT,/ + )e(Ar,AT') 
as required. □ 

3. Derivation of Theorem [I] 

As an immediate application of the comparison lemma, we derive 
Theorem |T] from Theorem [| 

Fix a. Suppose there are infinitely many rationals bj/qj with qj 
prime, satisfying 




We let Nj = [j^r], where [.] denotes the integer part function. Fix an 
m > 2 and a test function / as above. We need to show that 

lim R^iN^aJ) = / f(0,x 2 ,---,x m )dx 2 ---dx m . 

J0<X2<-<x m 
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By Theorem | applied to q = q 3 , b = bj and N = Nj we know that 



lim R { - m) (N j ,bj/q j , f) = / f(Q,x 2 ,---,x m )dx 2 ---dx m 

Jo<x 2 <-<x m 

We use the comparison principle to estimate the difference: 

\R^(N 3 ,a,f)-R^(N 3 ,b 3 /q 3 ,f)\. 

Take Nj = {{an 2 } : n < Nj} and Nj = {{bjn 2 / qj } : n < Nj}. By 
lemma [!|, 

(3.1) \R {m) (Nj, /) - R {m) (Nj, f) | < rt m \N 3 ,f + )e(N 3 ,N'j) 
for some admissible test function /+ > 0. We have 

h h N 2 1 



and thus 

By Theorem ^, i?( m ) (JV}, /+) is bounded (it converges as j — > oo). Thus 
we use Q3.1D to deduce that 

|itf m >(ty, a, /) - J^CJV,-, /)| - 
which gives Theorem [IJ. □ 

4. A DIVERGENCE PRINCIPLE 

We present a mechanism that ensure divergence of high correlations 
of the sequence {bn 2 /q}: The presence of larges square factors in q. 

Lemma 6. Let q = uv 2 with v > q 5 for some 5 > 0, let r\ > 1 — 8 and 
suppose that log N/ log q > n. Let f > be a positive admissible test 
function which is non-vanishing at the origin. Then for all b, 

(4.1) J2 (m) (tf,-,/)» I/(0)(— • 

q N q 

In particular R^ m \N ,bj 1 q, f) will diverge to infinity for m sufficiently 
large in terms of 5 and n. 

Proof. Write f(x ± , ...,x m )= g(x 1 -x 2 , ■ ■ ■ x m _ x -x m ) for g E C c (R m_1 ), 
g > 0, g(0) ^ 0. Then 

hW(w,»/) = 1 £ ,(..., wA-w^},...) 

o A* o o 

H l< ni <-<n m <N H H 
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Since g > 0, we may count only the contribution of those (ni, . . . , n m ) 
(rij distinct) for which all the components ni, . . . ,n m are divisible by 
uv. There are ^> [N/uv] m = [Nv/q] m such m-tuples. If n = uvn' then 
since q = uv 2 we have 

{^!} = {bu{n'f} = 

and so we find 

R ^)(N,-,f)^h(o)(— r ■ 

q JM q 

Since v > q s and N ^> q v with r] > 1 - 5, this gives R^(N, |, /) > q s 
with 

s = f](m — 1) + m5 — m = m(r] — (1 — 5)) — f] 

which is positive if m > 77 / (77 — (1 — 5)) > 0. Thus for m sufficiently 
large, R^ m \N, b/q, f) will diverge in these ranges. □ 



5. Proof of Theorem f| 

Fix m > 2, some small S > 0, and let N, q be large such that q 1 ^ 3 < 
N < g ™/(™+2)-<5_ Let / > is an admissible test function, /(0) ^ 0, 
and /+ > / the smooth majorant appearing in Lemma B. We want to 
show that there exists b < q coprime to q such that R^ m '(N, b/q, f + ) is 
large. 



We first produce q' which is a square, q' 
that 



coprime to q, such 



(5.1) 



qq x iV 3 (logiV) 3 . 



To do so, find v in the interval 



J 



' .Y :; log 3 iV 2 /iV3 1og 3 iY 



Q 



which is coprime to q. Note that A^ 3 log 3 N/q ^> (logg) 3 since iV > q 1 ^ 3 
and so the existence of such numbers v is assured for any q sufficiently 
large. Indeed, if q is sufficiently large then in any interval [x, 2x] with 
x ^> (logg) 3 / 2 there is a prime I not dividing q, since otherwise q would 
be divisible by all primes in the interval and consequently log q would be 
at least as large as ^2 x<p<2x logp ~ x which contradicts x 3> (logg) 3 / 2 . 
We now put q' = v 2 . Thus (q', q) = 1 and (O) holds. Because 



is a square, we may use the divergence principle (|4~T|) to see 
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that for all h' we have 



Since v = ^fq 1 <C y N 3 log N/q, we find that for all b' 



q' ,JJ jV 3m / 2 (logiV) 3m/2 N m / 2+1 (logN) 3m / 2 ' 
Now use q 1/3 <N< q m /( m + 2 )~ s to find that for some C > 0, 

(5.2) RW(N, -, /) > Cq S(m/2+1 \\ogq)- 3m/2 

q' 

uniformly in b' if q > go- Since 6 > 0, this diverges with g. 

Because q, q' are coprime, there are < b < q, < b 1 < q' so that 
&</ — b'q = 1 and so 

,6_6j 1_ _ 1 

'g ~ g 7 g7 ~ log 3 AT ' 

By the comparison principle (lemma |5]), the two sequences M = 
{{bn 2 /q} :n<N} and W = {{b'n 2 /q'} : n < N} satisfy 

(5.3) \R^(N, -, /) - rt m \N, - f , f)\ < e(Af,Af')R^(N, -, /+) 

q q q 

where /+ is a major ant for /, and in particular nonvanishing at the 
origin. Moreover 

e(Af,Af ) = Nmax\{ — } - {— } < \N 3 

(5.4) "f q \ q q q 

~ (logiV) 3 ^ (logg) 3 

We claim that 

R^)(N,-,f+) > -q S ^ 2+1 \logq)- 3m / 2 
q' 3 

Indeed, assuming otherwise we have from (|5.3j ) and (|5.4j) that 

\R {m) (N, -, f) - R( m \N, -,/)! = o(g 5(m/2+1) (logg)" 3m/2 ) 
q q' 

which together with ([T2|) forces i? (m) (N, b/q, f) > |g 5 ( m / 2 + 1 )(logg)- 3m / 2 . 
However, since / + > / > we find that 

E^fJV, -, /+) > R {m \N, -, /) > -g^/^Qogg)- 3 ™/ 2 
g g 3 

contradicting our assumption. □ 
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6. Preliminaries on continued fractions 

We recall the standard notions of the theory of continued fractions 
(see e.g. |TJ). 

Given integers a G Z, a 1; a 2 , • • • > 1, one defines integers p m , q m by 
the recursion (m > 1): 

Pm O'mPm—l Pm—2 
Qm O'mQm—l Qm—2 

with p_x = 1, p = do, g_i = 0, go = 1- These satisfy the relations 

PmQm—1 Pm—lQm ( 1) 

and 

PmQm-2 ~ Pm-2Qm = ( — l) m O m . 

The finite continued fraction 

[ao; ai, . . . , a m ] := ao H 



1 

ai H 



1 

a 2 H 



1 

'•• + — 

^m 

is then p m /q 

The infinite simple continued fraction [a ; Oi, a 2 , . . . ] is the limit of 
the "convergents" p m /q m - Every irrational a has a unique continued 
fraction expansion. 

The convergents give very good rational approximations to a: We 
have 

< \a < 



2 QmQm+l Qm QmQm+1 

The convergents p m /g m are the "best" rational approximations to 
a, in the following senses: If p/q satisfies \at — p/q\ < l/2q 2 then 
p/q — Pm/q m for some m. Moreover, for m > 1, if < q < q m and 
P/q 7^ Pm/q m then |a -p/q\ > \a-p m /q m \. 

7. Proof of Theorem |2|(a) 

7.1. Constructing a. We want to find an irrational a such that 

/V 1 / 4 

P- 1 ' fl(5)(a - /v » >> (EilvF' 

The construction below is due to the referee, who strengthened and 
considerably simplified our original argument. 
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We construct a by means of its continued fraction expansion, by 
inductively finding do, Oi, . . . , a m so that the denominators q m of the 
convergents are squares: q m = v^- 

To do so, define pairs of integers (r m , v m ) by r_i = v _i = 0, r = 
Vq — 1, ri = V\ — 1 and for m > 1 

(7.2) w m+ i = r m f^ + ti m _i, r m+ i = [log?; m +i] 

Now set ao = 1, and for m > 

a m+l — r m V m + 2r m f m -l 

Let a = [a ; ai, a 2 , . . . ] = [1; 1, 3, 6, ... ]. 

We claim that the denominator q m of convergent to a equals v^. To 
see this, use induction: By the recursion for the convergents, q m+ \ = 
a m +iqm + q m -i and by induction 

2 2 
Qm+l — 0"m+l v m + v m -l 

.2 „,2 



-'m+l 



/ 2 , \2 2 

(r m v m + v m -i) =v v 



as required. 

Note also that from the recursion (177 



q m +i ~ r^gjj, ~ g^(logg m ) 2 

Thus a is of type 3 + e, for all e > 0. 

Now we want to show that _R (5) (a, N) > iV 1/4 /(log N) 5 . Pick m so 
that q m < N < q m+ i. We will replace the sequence of fractional parts 
Af = {{an 2 } : n < N} by a different sequence depending on the size 
of N relative to q m . 

7.2. Case 1: Assume that g™ 3 < N < q m +i- Recall that q m+ i ^> 
g^logg m and so this range is nonempty. Replace Af by the sequence 
Af' = {x' n : n < N} where x' n = {p m +in 2 /q m+ i}. These two sequences 
have asymptotically equal correlations since 



\x n — x„\ < \a \n < -t, 5 — "C 



Qm+i'" " q m +iq m +2 ~ r^ +1 g^ +1 iV(logiV) 2 

since q m +i > N and r m+1 = [logg m+ i] ^> logiV. Thus by the com- 
parison principle (lemma ||p, it suffices to work with the new sequence 
Af'. 

By the divergence principle (see ( |4.1| )), since q m +i = Vm+ii ^ ^ ne test 
function / is nonvanishing at the origin we find that 
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Since q m <C N 3 ^ and r m ~ log iV we find that 

i? (5) (A/', /) > A^ 1 / 4 (bg AT)" 5 
proving (O) when q4 /3 < A^ < g m+ i. 



7.3. Case 2: q m < N < qtt . Set 



3/2 



ATl/2 



Note that since q m < N < qtl z , M lies between N 5 / 8 and N. We 
replace TV by the sequence Af" = {x" t : n < N} where 



|Pm n 2j n <M 
x n = {an 2 }, M <n< N 



To check that correlations of Af and Af" are asymptotically equal, we 
need to see that \x n — x"| = o(l/N). For n > M this certainly holds, 
while for n < M we have 

p m , 2 M 2 



.2 T 



xJ < a 7i < 



9m QrnQm+1 



Now use g m+ i ^> r^g 2 ^ and since q m ~ (M 2 AQ 1//3 and r m 3> logA^, we 
have 

> r 2 m q 3 m > (log A^) 2 M 2 A r 

which gives \x n — x'^\ <C l/N(\ogN) 2 as required. 

Now we study the sequence Af". The number occurs in Af" if 
n < M is a multiple of t> m : n = f m n', since then x" = {— n 2 } = 

{Pm(n') 2 } = 0. Thus occurs as a difference of 5-tuples of elements of 
Af" at least ^> [M/v m ] 5 times. Thus if the origin lies in the support of 
the test function / then 

R {) (Al ,f) > — ( — ) 



N'vJ Nq ^ 
Since M 2 = q^/N, and N < qtl 3 we get 



R®(Af", f) » 4rr, » N 1 ^- 7 ' 2 = N 1 ' 4 
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8. Proof of Theorem |2](b) 

Let a > 23/8. We construct a = [a ; a 1; . . . , a m , . . . ] which will 
be of type a and for which limsup R^(a, N) = oo by an inductive 
construction of the partial quotients a m . 

Suppose we have already found ao, . . . , a m -i, from which we got the 
partial convergents Pj/qj, j = 0, . . . , m — 1. Now take an integer £ ~ 
^m-i^ 2 ) which is coprime to q m -i (this is certainly possible for m 3> 1, 

say take £ a prime between cft^-i 2 anc ^ ^Qm-i^ 2 wn i cn does not divide 
g m -i). Also set v m = £. 

Because £ and q m -i are coprime, there is a unique solution t — a m 
of the congruence 

(8.1) tq m -i + q m -2 = mod £ 2 

which lies in [£ 2 , 2£ 2 ). Then a m ~ £ 2 ~ g^" 2 , and 

:= «m<?m~l + gm-2 ~ 9m-\ ■ 

Thus a is of type a + e for all e > 0. Moreover q m is divisible by by 
(|8l). Thus 

g?rt ^vnVm 

for some integer w m , and 



Now take 



u ~ a (<T ~ 2)/2 ~ (-- 2 )/( 2ct - 2 ) 



a/3 

N qm 



logg m 

We will see that R^ 5 '(a, N m ) — > oo as m — > oo. 

To see this, note that in the sequence {an 2 : n < N m } we may re- 
place a by the partial convergent p m /qm without changing the limiting 
correlations. To see this, note that by Lemma [5] it suffices to check 
that l/q m q m+ i = o(l/N^). Indeed, we have 

1 1 1 



<?m<?m+l q^n 

iV3(logiV m )3 

as required. 

To see that R^ 5 '(N m , — , f) diverges for positive test functions / with 
/(0) 7^ 0, use the divergence principle QL1|) to find 
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Now use N m ~ qm 3 / logg m and v m ~ g™ 2 ^ <y2a 2 ' to find 

/r : (.Y„,.^./) » "»< 



q m ' (log g 



5 

m I 



where 



„ 4(7 a -2 <r(8a-23) 
£ = — + 5- 



3 2a -2 6(<r-l) 
Since a > 23/8, we have -E 1 > which gives divergence of R^ 5 \ □ 

9. Proof of Theorem |] 

Fix m > 2, / and 5 > 0. By approximating f(0,X2,--- ,x m ) 
from above and below with step functions we see that it is enough 
to prove the statement for a function / symmetric, satisfying f(x + 
(t,t,- ■ ■ ,t)) = f(x) for alH e R and such that /(0, x 2 , • ■ • , x m ) is the 
characteristic function of a nice compact set I C R m_1 . In other words, 
given such an / and m, 5 as above, it is enough to show that as q — ► oo 
one has 

(9.1) R (m \N,b/q,I) -+Vol{I) 

uniformly for (b, q) = 1 and g 1 " 2^ +<5 < iV < where iVi^ m ) (iV, 6/?, I) 
is the number of tuples (xi, ■ ■ ■ , x m ) with distinct components x±, ■ ■ ■ ,x m 
in {1, • • • , N} such that 

q q q q 

Given a large prime number q and 6, iV as above, we write R( m \N, b/q,I) 
in the form 



where s = is the dilate factor, and 

u(N, a) = #{1 < Xi < N : 6a;? - bx 2 i+1 = a t (mod g), 1 < % < m- 1} . 

Here m eans the summation is over the vectors a for which the 
partial sums A4 = 2~2k>i a k> A m = 0, are distinct, a condition which 
comes from the requirement that the m-tuples ) to be 

counted in R^(N,b/q, I) have distinct components. Let 



h s (x) 



b[x\ — xf +l ) = dj (mod q), i — 1, • • • , m — 1 
else. 
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Thus: 

v(N,c£)= h ^)' 

l<xi,"- ,x m <N 

We now use the Fourier expansion: 

m 

v(N,a)= H^HMri) 

r (mod q) i=l 

where 

y (mod q) 

and: 

These last sums are geometric series which can be bounded by: 

(9.2) \\F N (r t )\\ «min{iV,^-} 

N 

where the residues Vi are assumed to lie in the interval |]. 
R^ m \N,b/q,I) = -J2 Ur)U F ^ 

aesl r (mod <j) i=l 

we isolate the contribution of r = to get the main term : 

(9.3) R (m \N,b/q,I) = M + S 
with 

* 

(9.4) M = N m ~ l h M 
and 

(9-5) £ = - y, nwE^- 

O^r (mod g) i=l a€s/ 

We first estimate the main term. For any a let C(a, q) be the 
mod q given by the system of congruences: 

bx\ — bx\ = ai (mod q) 
bx m-i ~ bx m = a m-i (mod q). 
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One has ha(0) = -^is(a,q), where v(a,q) is the number of points on 
the curve C(a,q). Thus 

Arm— l * 

(9.6) M = — '£ V (a,q). 

a£sl 

We want to show that as q — > oo one has: 

(9.7) M = Vol(I) + o(l). 

For any a = (ai, • • • , o m _i) denote by r e ff(a, q) the number of distinct 
?/j satisfying the following system: 

(9.8) yi - y i+ i = a { ( mod g), 1 < % < m - 1. 
Since the solutions of the homogeneous system 

Z/i - = 0( mod q), 1 < z < m - 1 

are spanned by (1, • • • ,1), r e ff(a,q) is well-defined (independent of 
the particular solution y of ( |9.8| )). Using the Riemann Hypothesis for 
curves over finite fields (Weil [|15|]) one obtains (see also Hl2| , Proposition 
4): 

(9.9) v{a, q) = 2 m - J V/K<?)( (? + B{a, q)) 
with 

(9.10) \B(S,q)\ < m gi 
We define roots crjj(a),l < i < j < m by 

j'-i 

(9.11) aij(d) = 2^ ak 

k=i 



so that cr iji+ i(a) = a*, o^- = £^ =i ^fc,fc+i- We set D(a) = rii<i<j<m ° 
The solutions of ( |9.8| ) are all distinct (i.e. r e //(a, q) = m) if and only if q 

does not divide D(a), since j/;-^ = Vk-Vk+i = J2k=] a k = a ij(a)- 
Note that .D(a) is a nonzero integer for any a which appears in the 
above summations ^* es/ . In our case q does not divide D(a), since for 
N large enough in terms of I each factor <7ij(a) of D{a) is in absolute 
value smaller than q. Therefore r e ff(a,q) = m and (|9~9| ) and ( |9.10| ) 
give 



(9.12) v(a,q) = q + O m (q 1 *) 

for all a which appear in ( |9.6| ) . Then (|9.6| ) implies that 
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Arm— 1 * 1 1 * 

(9.13) M = — (g + O m (<P )) E 1 = -^li 1 + °™(-)) E L 

a£sl " a&sl 

The number of integer points a G si which lie in the union of the 
hyper-planes (Jij(a) = is O m j(s m ~ 2 ), while by the Lipschitz principle 
(see Davenport ||) it follows that: 

#(sJ n z™- 1 ) = s m - 1 \/o/(J) + o mJ {s m - 2 ). 

Therefore: 

* 

(9.14) E 1 = n Zm_1 ) ~ G s/ : = °) 

= s m - 1 Vol(I) + O mJ (s m - 2 ) 
and from fl9.13|) we get 

A* = (i + m (4=))(i + O m ,/(i)) 

which proves fl9.7|). 

We now proceed to estimate the remainder £. For any a and r we 
have: 



H y&C{a,q) ^ 

Applying Weil's Riemann Hypothesis for curves over finite fields one 
has (see ||, Theorem 6) 

(9.15) j E e (-—)\<mVv~ 

unless the linear form r ■ y is constant along the curve. For a as in 
(9.5) this only happens if r = 0. For, let f ^ be such that r ■ y is 
constant along the curve. Then, in the function field F q (Yi, • • • , Y m ) of 
the curve, where F q denotes the algebraic closure of F q = Z/qZ, Yi is 
a variable and Y 2 , - ■ ■ , Y m are algebraic functions such that 

for 2 < z < m, we will have an equality r ■ Y = c, with c G F ? . 
If we choose j G {1, • • • ,m} such that r J0 7^ then Yj will lie in 
F g (Ti, • • • , Yjo-i, F io +i> • ■ ■ , y m ) and hence 

(9.16) F q (Y u ■ ■ - , Y m ) = F q (Y l7 • ■ ■ , Y jo ^, Y jo+h ■ ■ - , Y m ). 
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Now for any unique factorization domain D of characteristic 7^ 2 and 
any distinct primes p%, ■ ■ ■ ,pt in D one has 

mvpi,--- ,vpt)--K} = 2 t 

where K denotes the quotient field of D (see Besicovitch 0)). Applying 
this with D = FjYi] and p { = Y? - ai+ ' b +a ' for 1 < i < m - 1 we get: 

[F 9 (y 1 ,---,y m ):F g (Y 1 )] = 2 m - 1 . 

By the same argument we see that 

[F g (Y"l, ■ ■ • , Yj -1, ^io+l; ' ' " ) ^m) : Fq(Y"i)] = 2 m 2 

which contradicts ( |9 . 1 . It follows that for all r and a which appear 
in ( p.5| ), the inequality ( |9.15| ) holds true and one has: 



\ha{r)\ < 



m 1 • 

q m -2 



This implies that 



(9.17) 



1 



AY' 



£ (ni^wi)E 1 



O^r (mod q) i=l 

We use ([O]) and (^TJ) in ( gT7| ) to conclude that 



-,m—l 



(9.18) 



If I < 



" r (mod g) i=l 

g m ~"§log m g /logg\ m 



< 



Q 



The theorem now follows from (|9.3|), (|9.7p and (|9.18|). 



Appendix A. Square factors of rational approximants 

Let a be a real number and a n /q n a sequence of rational approxi- 
mants of a: \a — a n /q n \ < l/q' 2 , and q n — > 00. In view of Theorem 1', 
we want to investigate the square parts of the denominators q n , keep- 
ing in mind that large square parts rule out Poisson statistics for the 
correlation functions. 

Definition A.l. A sequence {q n } is almost square-free if Ve > 0, all 
square divisors s 2 of q n satisfy s n <C e q € n - 
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A.l. A metric result. We will show that for almost all a, we have: If 
dn/Qn is a sequence of rational approximants of a (that is \a — a n /q n \ < 
1/q 2 , and q n — > oo), then {q n } is almost square-free. 

In fact, we show more: For an integer q > 1, we write q = qs 2 
with q square-free. Let T be the set of integers q whose largest square 
factor s 2 satisfies s < log 2 q. We will show that almost all reals a have 
rational approximants whose denominators are in T except for finitely 
many exceptions. 

Proposition 7. For all reals a outside a set of measure zero, there is 
a Q = Q(a) > 1 so that if \a — a/q\ < 1/q 2 and q > Q then q £ T . 

The proof of this follows from a well-known general principle: Given a 
sequence of integers Af, we say that a real number a is A/"-approximable 
if there are infinitely many rationals a/q ^ a with denominator q £ M 
and \a — ajq\ < 1/q 2 . For instance, we may take as M the complement 
of T . To prove Proposition |7|, we will use 

Lemma 8. Suppose that M is a sequence such that 



< oo. 

Then the set of H -approximable reals has measure zero. 

Proof. Without loss of generality we will assume that < a < 1. For 
each pair of coprime integers (a, q) with 1 < a < q, denote by I a ^ q the 
interval 

a la 1 
q q z q q z 

Then a is A/"-approximable if and only if it lies in infinitely many of 
the intervals l aA with q £ M . That is for all N > 1, a lies in 

M N \= Un< q( zN Ul< a < g I a ,q- 

Thus we need to compute the measure of M := n^>iM N . Since M N I) 
Mjv+i ^ . . . , we have 

I i 
meas(M) = limmeas(Mjv) < lim meas(/ ai(? ) -C lim - 

A N N<q€Af a=l N N<q£Af $ 

(allowing overlap of the intervals). Since E^ geAr l/g < oo, the above 
limit is zero. □ 

Thus to prove Proposition |7|, it suffices to show 

< oo . 



E 1 
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We rewrite this sum by grouping together those q with the same square- 
free kernel q: Writing q = fm 2 , q — f, then 

9^ / square-free <?=/. / square-free g=/m 2 £F 
Now if g ^ J 7 , q — / then m > log 2 /. Thus for each /, 



£ V £ - 



1 



9=K^ m>log 2 / & ^ 



and so 



Ei 



1 1 

<<: 2^ 7732— < 00 



f square- tree 

as required. □ 

A. 2. Algebraic a. For real algebraic a, the analogue of Proposition ^ 
follows from a standard belief in diophantine analysis, namely the 
'ABC Conjecture" of Masser and Oesterle: Define the radical of an 
integer N as the product of all primes dividing it: rad(A) := n p |ArP- 
The ABC conjecture is the assertion that whenever we have an equation 
in coprime integers A + B + C = 0, then 



(A.l) \A\ < e rad(ABC) 



l+e 



for all e > 0. This implies a seemingly stronger statement: Suppose 
that G(x,y) G Z[x,y] is a homogeneous form with integer coefficients 
and no repeated factors, and m, n coprime integers. Then for all e > 

(A.2) max(|m|, |n|) deg(G) - 2 - £ < e rad(G(m, n)) , 



where deg(G) is the degree of G. The deduction of (|A.2| ) from ( [A. 1 
and a theorem of Belyi [|lj] was noted by Elkies [[J and by Langevin 
fll3| . The ABC-conjecture (|A.l ) is the special case of the ternary form 



G(x,y) = xy(x + y). 

The corollary (|A.2|) of the ABC conjecture implies the analogue of 
Proposition ^ for irrational algebraic a. Indeed, let f(x) be the minimal 
polynomial of a, of degree d > 1, and write f(x/y) = F(x,y)/y d 
with F(x,y) G Z[x,y}. Suppose that p/q is an approximant of a: 
\a—p/q\ < l/q 2 , withp, q coprime. Since f(x) is irreducible, f'(a) ^ 
and thus by the mean value theorem, for some £ between a and p/q, 

l/(-)l = l/(-)-/(«)l = l«-^llf(0l«4 

q q q q 
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On the other hand, 



p F(p,q) 



q <r 

and so we find 

\F(p,q)\ «g d - 2 . 

By (|A.2| ), taking G(x,y) = xyF(x,y) and noting that \p\ <C q, we 
get for all e > 

q d ~ € < e rad(pqF(p, q)) < \pF(p, q)\ rad(g) < q d ~ l rad(g) . 

Thus if q = qs 2 then 

(qs 2 ) d ~ e < £ rad(gs) < qs 

and so s <C e g e . 
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